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■ Abstract 

' Motivated by applications in storage systems and property testing, we study data stream 

. algorithms for local testing and tolerant testing of codes. Ideally, we would like to know whether 

there exist asymptotically good codes that can be local/tolerant tested with one-pass, poly-log 
space data stream algorithms. 

We show that for the error detection problem (and hence, the local testing problem), there 
, exists a one-pass, log-space data stream algorithm for a broad class of asymptotically good 

codes, including the Reed-Solomon (RS) code and expander codes. In our technically more 
involved result, we give a one-pass, 0(e log^ n)-space algorithm for RS (and related) codes with 
dimension k and block length n that can distinguish between the cases when the Hamming 
distance between the received word and the code is at most e and at least a ■ e for some absolute 
constant a > 1. For RS codes with random errors, we can obtain e ^ 0{n/k). For folded RS 
, codes, we obtain similar results for worst-case errors as long as e ^ {n/kY^^ for any constant 

'sj" ' £ > 0. These results follow by reducing the tolerant testing problem to the error detection 

, problem using results from group testing and the list decodability of the code. We also show 

' that using our techniques, the space requirement and the upper bound of e ^ 0{n/k) cannot 

, be improved by more than logarithmic factors. 

•r-j , 1 Introduction 

X 

■ In this work, we consider data stream algorithms for local testing and tolerant testing of error- 
correcting codes. The local testing problem for a code C Q is the following: given a received 
word y G S", we need to figure out if y € C or if y differs from every codeword in C in at least 
< e ^ n positions (i.e. the Hamming distance of y from every c £ C, denoted by A(y, c), is at 
least e). If e = 1, then this is the error-detection problem. In the tolerant testing problem, given 
y, we need to decide whether y is at a distance at most ei from some codeword or if it has distance 
at least 62 > ei from every codeword in C. Ideally, we would like to answer the following (see 
Section [2] for definitions related to codes): 

Question 1. Do there exist asymptotically good codes that can be (tolerant) tested by one-pass, 
poly-log space data stream algorithms? 
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To the best of our knowledge, ours is the first work that considers this natural problem. We 
begin with the motivation for our work. 

Property Testing. Local testing of codes has been extensively studied under the stricter require- 
ments of property testing. Under the property testing requirements, one needs to solve the local 
testing problem by (ideally) only accessing a constant number of positions in y. Codes that can be 
locally tested with a constant number of queries have been instrumental in the development of the 
PCP machinery, starting with the original proof of the PCP theorem [21 14]. The current record is 
due to Dinur, who presents codes that have inverse poly-log rate, linear distance and can be locally 
tested with a constant number of queries |12j . 

General lower bounds on local testing of codes, however, have been scarce. (See, e.g. the recent 
paper [7]). In particular, it is not known if there are asymptotically good codes that can be locally 
tested with a constant number of queries. The question remains open even if one considers the 
harder task of tolerant testing with a constant number of non-adaptive queries |16] . 

It is not too hard to see that a non-adaptive tolerant tester that makes a constant number of 
queries gives a single pass, log-space data stream algorithm for tolerant testing. (See Section [3] for a 
proof.) Thus, if one could prove that any one-pass data stream algorithm for local/tolerant testing 
of asymptotically good codes requires a;(logn) space, then they will have answered the question in 
the negative (at least for non-adaptive queries). This could present a new approach to attack the 
question of local/tolerant testing with a constant number of queries. 

Next, we discuss the implications of positive results for local/tolerant testing. 

Applications in Storage Systems. Codes are used in current day storage systems such as 
optical storage (CDs and DVDs), RAID ([H]) and ECC memory ([TO])- Storage systems, up until 
recently, used simple codes such as the parity code and checksums, which have (trivial) data stream 
algorithms for error detection. However, the parity code cannot detect even two errors. With the 
explosion in the amount of data that needs to be stored, errors are becoming more frequent. This 
situation will become worse as more data gets stored on disks [H]. Thus, we need to use codes 
that can handle more errors. 

Reed-Solomon (RS) codes, which are used widely in storage systems (e.g. in CDs and DVDs 
and more recently in RAID), are well-known to have good error correcting capabilities. However, 
the conventional error detection for RS codes is not space or pass efficient. Thus, a natural question 
to ask is if one can design a data stream algorithm to perform error detection for RS codes. 

It would be remiss of us not to point out that unlike a typical application of a data stream 
algorithm where n is very large, in real life deployments of RS codes, n is relatively small. However, 
if one needs to implement the error detection algorithm in controllers on disks then it would be 
advantageous to use a data stream algorithm so that it is feasible to perform error detection with 
every read. Another way to use error detection is in data scrubbing |14] . In this scenario, during idle 
time or low activity periods, error detection is run on the entire disk to catch errors. In addition, 
the single pass requirement means that we will probe each bit on a disk only once, which is good 
for longevity of data. Finally, it would be helpful to complement an error-detection algorithm with 
a data stream algorithm that could also locate the errors. 

It is also plausible that the efficiency of the data stream algorithms will make it feasible to use 
RS codes of block length (and alphabet size) considerably larger than the ones currently used in 
practice. 

Before we delve into the description of our results, we would like to point out a few things. 
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First, for the storage application, designing algorithms for a widely used code such as the RS code 
will be more valuable than answering Question [T] in the affirmative via some new code. Second, 
it is known that for local testing of a RS code of dimension k, at least k queries need to be made 
in the property testing worlcQ- However, this does not rule out the possibility of a local/tolerant 
tester in the data stream world. Finally, for storage systems, solving the tolerant testing problem 
even for large constants ei and 62 would be interesting. 

Our Results. We give a one-pass, poly-log space, randomized algorithm to perform error detection 
for a broad class of asymptotically good codes such as Reed-Solomon (RS) and expander codes. 
As a complementary result, we also show that deterministic data stream algorithms (even with 
multiple passes) require linear space for such codes. Thus, for local testing we answer Question[T]in 
the affirmative. This should be contrasted with the situation in property testing, where it is known 
that for both asymptotically good RS and expander codes, a linear number of queries is required. 
(The lower bound for RS codes was discussed in the paragraph above and the result for expander 
codes follows from [8].) 

It turns out that using existing results for tolerant testing of Reed-Muller codes over large 
alphabets [H], one can answer Question [T] in the affirmative for tolerant testing, though with 
0{n^) space for any constant e > 0. (See Section [3.21 for more details.) 

Given the practical importance of RS codes and given the fact that local testing for RS codes 
with data stream constraints is possible, for the rest of the paper we focus mostly on tolerant testing 
of RS and related codes. We first remark that a naive tolerant testing algorithm for RS codes that 
can be implemented in O(elogn) space is to go through all the X]i=i (^) possible error locations S 
and check if the received word projected outside of S belongs to the corresponding projected down 
RS code. (This works as long as e ^ n — A;, which is true w.l.o.g. since n — k \s the covering radius 
of a RS code of dimension k and block length n.) Using our error detection algorithm for RS codes, 
this can be implemented as a one-pass 0(elogn)-space data stream algorithm. Unfortunately, the 
running time of this algorithm is prohibitive, even for moderate values of e. 

In this paper, we match the parameters above to within a log factor but with a (small) polyno- 
mial running time for values of e much larger than a constant. In particular, we present a one-pass, 
0(elog^ n)-space, polynomial time randomized algorithm for a RS code C with dimension k and 
block length n that can distinguish between the cases when the Hamming distance between y and C 
is at most e and at least a-e (for some constant a > 1). This reduction works when e{e + k) ^ 0(n). 
If we are dealing with random errors, then we can obtain ek ^ 0(n). Using known results on list 
decodability of folded RS codes [17], we obtain similar results for worst case errors for e ^ (n/A;)^~^ 
for any constant e > 0. As a byproduct, our algorithms also locate the errors (if the number of 
errors is bounded by e), which is desirable for a storage application. We also show that using our 
techniques, the space requirement and the upper bound of e ^ 0{n/k) cannot be improved by 
more than logarithmic factors. 

Ideally, we would like our data stream algorithms to spend poly-log time per input position. 
However, in this paper we will tolerate polynomial time algorithms. In particular, naive implemen- 
tations of the tolerant testing algorithms take O(n^) time. We also show that at the expense of 
slightly worse parameters, we can achieve a running time of 0{ne) for certain RS codes. 

Our Techniques. It is well-known that error detection for any linear code can be done by checking 
if the product of the received word with the parity check matrix is the all zeros vector. We turn this 

^This follows from the fact that the "dual" code has distance k + 1. 
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into a one-pass low space data stream algorithm using the well-known finger printing method. The 
only difference from the usual fingerprinting method, where one uses any large enough field, is that 
we need to use a large enough extension field of the finite field over which the code is defined. To 
show the necessity of randomness, we use the well-known fooling set method from communication 
complexity |19| . However, unlike the usual application of two-party communication complexity in 
data stream algorithms, where the stream is broken up into two fixed portions, in our case we need 
to be careful about how we divide up the input. Details can be found in Section [5l 

We now move on to our tolerant testing algorithm. We begin with the connection to group 
testing. Let c be the closest codeword to y and let x G {0, 1}" denote the binary vector where 
= 1 iff T/j 7^ Cj. Now assume we could access x in the following manner: pick a subset Q C [n] 
and check if xq = or not (where xq denotes x projected onto indices in Q). Then can we come 
up with a clever way of non-adaptively choosing the tests such that at the end we know whether 
Wt(x) ^ e or not? It turns out that we can use group testing to construct such an algorithm. 
In fact, using e-disjunct matrices (cf. |13j). we can design non- adaptive tests such that given the 
answers to the tests one could compute x if wt(x) ^ e, else determine that wt(x) > e. (A natural 
question is how many tests do e-disjunct matrices require: we will come back to this question in a 
bit.) This seems to let us test whether y is within a Hamming distance of e from some codeword or 
not. We would like to point out that the above is essentially reducing one instance of the tolerant 
testing problem to multiple instances of error-detection. 

Thus, all we need to do is come up with a way to implement the tests to x. A natural option, 
which we take, is that for any test Q ^ [n], we check if yq S RSq[A;], where RSq[/c] is the RS code 
(of dimension k) projected onto Q. This immediately puts in one restriction: we will need \Q\ ^ k 
(as otherwise every test will return a positive answer). However, there is another subtle issue that 
makes our analysis more complicated- we do not necessarily have that xq = iff G RSq[/c]. 
While it is true that = implies yg G RSqlk], the other direction is not true. The latter is 
possible only if y agrees with some codeword c? 7^ c in the positions indexed by Q. Now if s is the 
size of the smallest test and it is the case that the only codeword that agrees with y in at least s 
positions is c, then we'll be done. We show that this latter condition is true for RS codes if s ^ e + k 
for worst-case errors or with high probability if s ^ Ak for random errors. 

It is now perhaps not surprising that the list decodability of the code plays a role in our general 
result for worst-case errors. Assume that the code C under consideration is {n — s,L) list decodable 
(i.e. every Hamming ball of radius n — s has at most L codewords in it) and one can do error 
detection on C projected down to any test of size at least s. If we pick our disjunct matrix carefully 
and L is not too large, it seems intuitive that one should be able to have, for most of the tests, 
that Xq ^ implies yq ^ Cq. We are able to show that if the matrix is picked at random, then 
this property holds. In addition, it is the case that the "decoding" of x from the result of the test 
can be done even if some of the test results are faulty (i.e. yq G Cq even though xq ^ 0). The 
proof of this fact requires a fair bit of work: we will come back to the issues in a bit. 

Another side-effect of the fact that our algorithm does not readily translate into the group 
testing scenario is that even though we have been able to salvage the case for wt(x) ^ e, we can 
no longer guarantee that if wt(5;) > e, that our algorithm will catch it. In the latter case, our 
algorithm might say wt(x) > e or it might return a subset S C [n] that purportedly contains all the 
error locations. However, we can always check if y[n]\s ^ R'S[,„]\5[/c] to rule out the latter case. This 
seems to require another pass on the input but we are able to implement the final algorithm in one 
pass by giving a one-pass algorithm for the following problem: Given as input y followed by T C [n] 
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such that |r| = e, design a one-pass O(elogn) space algorithm to check if y[n]\T ^ R'Sj„]\y[A;]. The 
main idea is to encode the locations in T as an unknown degree e polynomial and to fill in the 
unknown coefficients once the algorithm gets to T in the input. 

We now return to the question of how many tests we can get away with while using e-disjunct 
matrices. The best known construction uses O(e^logn) tests [13] and this is tight to within a 
loge factor (cf. [15]). Thus, to get sublinear space, we need to have e = o{^/n). To break the 
^/n barrier, instead of e-disjunct matrices, we use the recently discovered notion of the (e, e)-list 
disjunct matrix [18] . An (e, e)-list disjunct matrix has the property that when applied to x such that 
wt{x) ^ e, it returns a subset S C [n] such that (i) Xi = 1 implies i £ S and (ii) \S\ ^ wt(5;) + e. It 
is known that such matrices exist with 0{e log n) rows. In Section [8l we show that such matrices can 
be constructed with O(elog^n) random bits. However, note we can now only distinguish between 
the cases of Wt(x) ^ e and wt(x) ^ 2e. 

The use of list disjunct matrices also complicates our result for worst case errors that uses 
the list decodability of the code under consideration. The issue is that when we pick the desired 
matrix at random, with the extra task of "avoiding" all of the L — 1 codewords other than c that 
can falsify the answer to the test, we can only guarantee that the "decoding" procedure is able to 
recover a constant fraction of the positions in error. This is similar to the notion of error reduction 
in |23j . This suggests a natural, iterative 0(loge)-pass algorithm. Using our earlier trick, we can 
again implement our algorithm in one pass. Finally, the plain vanilla proof needs Q{n) random 
bits. We observe that the proof goes through with limited independence and use this to reduce the 
amount of randomness to O(e^log^n) bits. Reducing the random bits to something smaller, such 
as O(elogn), is an open problem. 

The speedup in the runtime from the naive O(n^) to 0{ne) for the tolerant testing algorithms 
is obtained by looking at certain explicit disjunct matrices and observing that the reduced error 
detection problems are nicely structured. 

There are two unsatisfactory aspects of our algorithms: (i) The 0(e log^ n) space complexity and 
(ii) The condition that e ^ 0{n/k) (which in turn follows from the fact that we have s = n/(2e)). 
We show that both of these shortcomings are essentially unavoidable with our techniques. In 
particular, a lower bound on the 1"^ decision tree complexity of the threshold function from [5] 
implies that at least 0(e) invocations of the error detection routine are needed. Further, we show 
that for sublinear test complexity, the support size s must be in O(Mogn). This follows by 
interpreting the reduction as a set cover problem and observing that any set covers only a very 
small fraction of the universe. 

2 Preliminaries 

We begin with some notation. Given an integer m, we will use [m] to denote the set {1, . . . , m}. We 
will denote by Fg the finite field with q elements. An axb matrix M over ¥q will be called strongly 
explicit if given any (i, j) e [a] x [b] , the entry Mj j can be computed in space poly (log g+log a+log b) . 
Given a vector y € (C C S" resp.) and a subset S CI [n], we will use ys {Cs resp.) to denote y 
(vectors in C resp.) projected down to the indices in S. We will use wt(x) to denote the number 
of non-zero entries in x. Further, for S C [n], we will use WTs(x) to denote wt{xs). 

Codes. A code of dimension k and block length n over an alphabet S is a subset of of size 
The rate of such a code equals k/n. A code C over Fg is called a linear code if C is a linear 
subspace of F^ . If C is linear, then it can be described by its parity-check matrix H, i.e. for every 
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c C , H ■ (F = 0. An asymptotically good code has constant rate and constant relative distance 
(i.e. any two codewords differ in at least some fixed constant fraction of positions). 

Tolerant Testers. We begin with the central definition. Given a code C C S", reals ^ d < c ^ 1, 
^ ei < £2 ^ 1 and integers r = r(n) and s = s(n), an (r, s, ei, e2)c,d-tolerant tester T for C is a 
randomized algorithm with the following properties for any input y £ S": (1) If A{y,C) ^ ein, 
then T accepts with probability at least c; (2) If A{y, C) ^ e2?^, then 7~ accepts with probability at 
most d] (3) T makes at most r passes over y; and (4) T uses at most s space for its computation. 

Further, we will consider the following special cases of an (r, s, ei, e2)c,d-tolerant tester: (i) An 
(r, s, 0, e)c,(i-tolerant tester will be called an (r, s, e)c,d-local tester, (ii) An (r, s, 0, l/n)c,d-tolerant 
tester will be called an (r, s)c,d-error detector. There are some definitional issues that are resolved 
in Section m 

List Disjunct Matrices. We give a low-space algorithm that can compute a small set of possible 
defectives given an outcome vector which is generated by a list disjunct matrix. Relevant definitions 
and material related to the algorithm can be found in Section [H 

Some Explicit Families of Codes. We now mention two explicit families of codes that we will 
see later on in the paper. We first begin with the Reed-Solomon code. Given g ^ n ^ 1 and 
a subset S = {ai, . . . C Fg, the Reed-Solomon code with evaluation set S and dimension k, 
denoted by RS5[/i;], is defined as follows: Any message in F^' naturally defines a polynomial P{X) of 
degree at most k — 1 over F^. The codeword corresponding to the message is obtained by evaluating 
P{X) over all elements in S. It is known that a (n — A;) x n parity check matrix of RS5 is given by 
HkSs = {"^j ■ a)}'^Zo~^\ "=i, where 

1 

Another explicit code family we will consider are expander codes. These are binary codes whose 
parity check matrices are incidence matrices of constant-degree bipartite expanders. In particular, 
if we start with a strongly explicit expander, then the parity check matrix of the corresponding 
expander code will also be strongly explicit. 

3 Connections to Property Testing 
3.1 The basic connection 

We now highlight a simple connection between tolerant testers in the data stream world and tolerant 
testers in the query world: 

Proposition 1. Let C C [q\^ he such that it has a tolerant tester T with query complexity r, 
thresholds £1 and £2 and time complexity tq{r) (i.e. it makes tq[r) operations over [q\ for any 
possible query realization). Then there also exists an {r,0{tq{r) + r log n),£i, £2)0,3 -tolerant tester 
T' . Further, ifT is non-adaptive, then T' can be implemented as a (l,0(tg(r) + rlogn),£i, £2)0,3- 
tolerant tester. 

Proof. The claimed result follows from the obvious simulation. In general, the tester T' works 
as follows: T queries r positions in the input and then applies some function on the queried values 
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(in the case when T' is adaptive, it apphes possibly r different functions after each query). As the 
total time complexity of T is tq{r), the entire computation of T can be done in time (and hence, 
space) tq{r). If T is non-adaptive, all the query positions can be decided upfront and all the values 
can be determined in one pass. Otherwise the simulation might need r passes in the worst case. 
We might need to use an additional 0(r log n) space to store the indices of the query positions, 
which implies that T' has the claimed properties. I 

Remark 1. In general, one cannot say much about tq{r) other than bounding it by 2'^^'^'^^ as 
the definition of the usual tolerant tester does not put any computational efficient constraints on 
the testers. However, if the tolerant tester T makes a constant number of queries then its time 
complexity is also a constant number of operations over the alphabet. 

3.2 Tolerant Testing of Reed-Muller Codes 

Let RM((7,^, m) denote the Reed-Muller code obtained by evaluating m-variate polynomials over 
Fq of total degree i < q. These codes are known to have block length n = g"*, dimension ("^^) and 
distance (1 — i/q)n (cf. |24l Lect. 4]). Note that this implies that if m is a constant and i = i^{q), 
then IlM{q,i,m) is asymptotically good. These codes are known to be tolerant testable in the 
property testing world with polynomial number of queries. 

Theorem 2 (|16]). Let m,i,q ^ 1 be integer such that i < c - q for some universal constant c. 
Then there exists a tolerant tester for 'RM{q,i,m) in the property testing world that can distinguish 
between at most sin and at least £2n errors with q = n-^/*" queries, where £i and £2 are absolute 
constants that only depends on c. 

In fact, the test is simple to describe: pick a random line in F^m and check if the projected 
down received word is within some threshold Hamming distance from the corresponding RS code 
of dimension H. and block length q. This latter step can be solved using the fast list decoding 
algorithm for RS codes from [2] in time O(glog^g) (and hence, in the same amount of space). 
Thus, Proposition [1] implies the existence of an asymptotically good code that can be tolerant 
tested by a one-pass, 0{rf) space (for any e > 0) data stream algorithm. (Note that the code has 
an absolute constant as its relative distance but its rate is exponentially small in 1/e.) 

4 Some definitional issues 

One decision that we need to make is how we count the space/time requirement for our algorithms. 
In particular, given a code defined over S, do we do our accounting in terms of number of operations 
over S or the number of operations over "bits"? This question is moot when S has constant size 
as both the measures will be within constant factors of each other. However, if can depend on 
n, which will be the case in some of the codes that we consider in this paper, the two measures will 
not be within constant factors anymore. In particular, for arbitrary S, an operation over S may 
take Vl{n) space and time, which will be prohibitive for our purposes. 

We resolve the question above in the following way: First, we will account for the complexity 
measures in terms of the number of operations in S. Further, for positive results, we will focus on 
the case where S = Fg with q ^ rP^'^\ Note that in this case, all operations (including addition, 
multiplication and exponentiation) can be done in poly(logn) time and O(logn) space. Finally, for 
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general fields Fg, the algorithm will also need access to an irreducible polynomial (of degree at most 
O(logn)). However, note that the definition of a code provides the definition of its alphabet. We 
will assume that the algorithm has full prior knowledge about the code (including e.g. the value 
of n). Thus, we will assume that the irreducible polynomial will be (implicitly) a part of the input 
to the algorithm. For certain cases, when the irreducible polynomial is part of an explicit family, 
the algorithms can compute these irreducible polynomials "on the fly" and thus, do not need to be 
part of the input. 

Finally, by definition, the block length of a code is fixed. However, for a meaningful asymptotic 
analysis, we need to think of an increasing sequence of block lengths. Thus, from now on when we 
talk about a code, we implicitly mean a family of codes. 

5 Data Stream Algorithms for Error-Detection 

A positive result. We first show that any linear code with a strongly explicit parity check matrix 
has an efficient 1-pass data stream error detector. 

Note that for a linear code C C with parity check matrix H, the error detection problem with 
the usual polynomial time complexity setting is trivial. This is because by the definition of parity 
check matrix for any y S Fg , y € C if and only \i H ■ = Q. However, the naive implementation 
requires Vt[n) space which is prohibitive for data stream algorithms. We will show later that for 
deterministic data stream algorithms with a constant number of passes, this space requirement is 
unavoidable for asymptotically good codes. 

However, the story is completely different for randomized algorithms. If we are given the 
syndrome s = Hjf" instead of y as the input, then we just have to solve the set equality problem 
which has a very well-known one-pass 0(log n)-space data stream algorithm based on the finger- 
printing method. Because s is a fixed linear combination of y (as H is known), we can use the 
fingerprinting technique in our case. Further, unlike the usual fingerprinting method, which requires 
any large enough field, we need to use an extension field of ¥q. For this, we need to get our hands 
on irreducible polynomials over Fg. 

5.1 Families of Irreducible Polynomials 

In our error detection algorithm we need low space construction of families of irreducible polynomi- 
als. Since our final algorithm will be randomized, a randomized algorithm to construct irreducible 
polynomials works. The following result is well-known (cf. [221 Chap. 20]: 

Theorem 3. Let q be a prime power, d he an integer and Q < 5 < 1 he a real numher. Then 
there exists a randomized algorithm that outputs an irreducihle polynomial of degree d over Fg with 
prohahility at least 1 — 5. Further, this algorithm makes 0((i^ log(l/(5) log g) operations overWq and 
needs 0(log(l/5) + dXogq) hits of space. 

Coming up with a deterministic polynomial algorithm for construction of irreducible polyno- 
mials is an open question. However, it turns out that in our application, we would be happy if 
the final irreducible polynomial has degree d' such that d' ^ d and is not much larger than d. In 
particular for prime p, there exists a deterministic algorithm that runs in time (and hence, space) 
poly((ilogp) and outputs a polynomial with degree at least d and at most 0{dlogp) [1]. 
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Next, we show that constructing such irreducible polynomials can also be done for fields of 
characteristic 2. The result follows from other known results. 

Theorem 4. Let q he a power of 2 and let d ^ 1 be an integer. Given the irreducible polynomial 
that generates ¥q, there exists a deterministic 0{dlogq) space, 0{{d'^ + log q)log^ q) -time algorithm 
that computes an irreducible polynomial over ¥q with degree d! such that d ^ d' ^ 2d. 

Coming up with an analogous result to Theorem d] for odd characteristic seems to be an open 
problem. 

We begin the proof of Theorem HI We will use the following result: 

Theorem 5 (cf. [9j). Let m ^ 1 be an integer and let (3 G such that Tr{f3) 7^ 0, where 
Tr{x) = X^^o^ trace function. Define the polynomials Ak{X) and Bk{X) recursively as 

follows (for k 0): 

Ao{X)= X 
Bo{X)= 1 
Ak+i{X)= Ak{X)Bk{X) 
Bk+i= Al{X) + Bl{X). 

Then Ak{X) + (3 ■ Bk{X) is an irreducible polynomial over ¥2^ of degree 2^'. 

To begin with, let us assume we can get our hands on a /3 as required in Theorem [3 Given 
such a /?, the rest of the proof is simple. Pick k to be the smallest integer such that d' = 2^ ^ d. 
It is easy to check that d ^ d' ^ 2d as required. To compute the final irreducible polynomial, we 
will need to do k iterations to compute Ai{X) and Bi(X) (for 1 ^ i ^ k). It is easy to check that 
each iteration requires 0(2Mogg) space (to store the intermediate polynomials) and 0(2^Mog^g) 
time (to compute the product of two polynomials of degree at most 2*). To complete the proof of 
Theorem m we show how to efficiently compute an appropriate (3. 

We claim that /3 can be chosen to be a* for some ^ z ^ m — 1, where we use {1, a, . . . , 
as the standard basis for ¥2^ , for some root a of the irreducible polynomial that generates ¥2m . □ 
To see why this is true, assume for the sake of contradiction that Tr{a^) = for every ^ i ^ m — 1. 
Then as every 7 S ¥2^. can be written as a linear combination of 1, a, ... , a"*"^, Tr{'y) = (this 
follows from the well-known fact that Tr (71 + 72) = Tr (71) + Tr (72)). This implies that Tr{X) 
has 2"^ roots, which is a contradiction as Tr{X) is a non-zero polynomial of degree 2™~^. Finally, 
the correct choice of /3 = a* can be determined by going through all ^ i ^ m — 1 and evaluating 
rr(a*) (which can be done in O(log'^g) time). 

We now state our result. 

Theorem 6. Let C C 6e a linear code of dimension k and block length n with parity check 

matrix H = {/i't,j}"=o^~^'"=x. Further, assume that any entry hij can be computed in space S{n,q), 
for some function S. Given an a ^ 1, there exists a (1, 0{S{n, q) +a log n))i ^-a-error detector for 
C. 

^ Note that if ?n is odd, then just /3 = 1 suffices. 
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5.2 Proof of Theorem [6] 

Let y = {yi, . . . , yn) G be the received word and let s = (sq, • • • , Sn-k-i) = Hy^ G F^"*"'. It is 
easy to check that for every ^ i < n — k, 

n 

Si = ^yjhi,j. (1) 

i=i 

Further define 

n—k—l 

S{X)= Si^'- 

Note that our task is to verify whether S{X) is the ah zeros polynomial. Towards this end, we will 
use the fingerprinting technique. 

Let Q = q'^ for some d to be chosen later such that 7i^~^°' ^ Q ^ qv}^"' . The algorithm is simple: 
pick a random /3 E Fq and verify if S{f5) = X^^^q = 0. By ([1]), this is the same as checking if 

Ylii=o (X]j=i yj^ij) /5* = 0- Note that this is possible as Fq is an extension field of Fg and thus, 
all the terms in the sum belong to Fg. Thus, by changing the order of sums, we need to verify if 

= 0. (2) 

It is easy to verify that the above sum can be computed in one pass as long as the quantity 
'n^=o~^ P^^iJ '^^^ be computed efficiently "on the fly." The latter is possible as we know /3 and we 
can compute any entry hij on the fly. If S{X) is the all zeros polynomial, then the check will always 
pass. If on the other hand, S{X) is a non-zero polynomial of degree at most n, then S{f3) = for 
at most n values /3 S ¥q. Thus, the probability of the check passing is at most n/Q which by our 
choice of Q is at most 1 /n°' . 

To complete the proof we need to analyze the space requirement of the algorithm above. First we 
note that by Theorem [3] we can compute an irreducible polynomial over ¥q of degree d = 2|^|^ . 

Note that n^+"' ^ Q ^ qn^"*""^ as claimed before. Also note that any operation in ¥q can be carried 
out by storing 0{d) elements from Fg. This implies that the sum in ([2]) can be computed with 
space 0{S{n, q) + d) = 0{S{n, q) + alogn). 

An inspection of the proof above shows that the time complexity of the error detector is domi- 
nated by the number of ¥g operations needed to compute Y17=o~^ P^'hij. 

For expander codes, this time complexity is just a constant number of Fq operations (and 
hence 0(logn/ log (7) operations in Fg). For RS codes, recall that we have hij = Vj ■ a*-, where 
Vn = Ti ^—7 T- If, say, for some fixed B G F* Vj = /3 for every 1 ^ j ^ n, then one can 

essentially ignore vj and one only needs to compute J27=o~^ I^^Oi)^ which is just ^ unless 

/3 = (in which case the sum is 0) or /3 = (aj)""*^ (in which case the sum is just (n — k) modulo the 
characteristic of Fg). The latter condition can be verified with poly (log n/ log g) operations in Fg. 

In general RS, any hij can be computed with 0{n) operations in Fg. Thus, the sum can be 
computed in time O(n^). 

Thus, we have argued that 



10 



Corollary 7. Let q be a prime power and define S = {ai, . . . , a„}. Then there exists an (1, 0(log ^^)l,l/2- 
error detector for RSs that runs in time 0{n?). Further, if there exists a /? G F* such that for every 
^ ^ j ^ n, Y\i<£<n e^ji'^j ~ ^) — then the algorithm can he implemented in 0{n) time. 

It is easy to check that S{q,n) is O(logn) for (strongly exphcit) expander codes and RS (and 
GRS) codes. This imphes the foUowing: 

Corollary 8. Let n ^ 1. Then for q = 2 and n ^ q ^ poly(n), there exists an asymptotically good 
code C C F^ that has a {1, O (log n)) 11/2- error detector. 

A negative result. We show that randomness is necessary even for local testing. In particular, 
we show the following: 

Theorem 9. Let C C [g]" be a code of rate R and relative distance 5 and let ^ e ^ 5^/8 be a 
real number. Then any {r, s,£)ifl -local tester for C needs to satisfy r ■ s ^ 

5.3 Proof of Theorem H 

We will be using communication complexity to prove Theorem [9l 

The proof uses he standard fooling set technique, however, unlike the usual application of two- 
party communication complexity in data stream algorithms, where the stream is broken up into 
two fixed portions, in our case we need to be careful about how we divide up the input. To see 
the necessity of this, consider the code C x C C S^" and say we break the received word y in the 
middle and assign the first half (call it yi) to Alice and the second half to Bob. In this case there 
is a simple O(logn) protocol- Alice simply sends the distance of yi to the closest codeword in C 
to Bob- to compute the distance of y to the closest codeword in C x C exactly. However, we can 
show that for every asymptotically good code, there is some way of breaking up the input into two 
parts such that there exists an exponentially sized fooling set. 

To further explain, we do a quick recap of some of the basic concepts in communication com- 
plexity and refer the reader to source material for more details |19j . 

Let g : {0, 1}"^ x {0, 1}"^ — )• {0, 1} be a function. Further assume Alice has x £ {0, 1}^^ and 
Bob has y G {0,1}"^. The (deterministic) communication complexity of g, denoted by CC{g), is 
the minimum number of bits that Alice and Bob must exchange in order to determine g{x, y) in 
the worst case. The following observation is a standard technique to obtain lower bounds for data 
stream algorithms: 

Proposition 10. Let A be an r-pass, s-space deterministic data stream algorithm that decides g. 
Then r-s^ CC{g). 

Next we consider the following technique for lower bounding the communication complexity of 
a function. A subset F C {0, Ij^^ x {0, l}"'^ is called a fooling set for g if (i) For every (x, y) G F, 
g{x, y) = b for some fixed b G {0, 1} and (ii) For every (xi, yi) 7^ (x2, 2/2) G either g{xi,y2) = 1 — 6 
or g{x2,yi) = 1 — 6. The following result is well-known: 

Proposition 11 (cf. [H]). Let g : {0, l}"i x {0, l}"^ {0, 1} and F be a fooling set for g. Then 
CC(5) ^log(|F|). 
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Finally, we will consider boolean functions with one input and we define their communication 
complexity as follows: Let / : {0,1}" — )• {0,1}. Further, for any ^ ni,n2 ^ n such that 
ni + n2 = n, define fni,n2 '■ {^i 1}"^ ^ {0) 1}"^ ~^ {0; 1} by naturally "dividing" up the n-bit input 
for / into the two required inputs for fni,n2- The communication complexity of / is then defined 
as follows: 

CC(/)= max CC 

n\+n2=n 

We are now ready to prove Theorem [9l We will do so by proving that the deterministic 
communication complexity of the following function is large. Define fc '■ {0, 1}" — t- {0, 1} such that 
fc{y) = ^ y ^ C; fc{y) = if A(y, C) ^ (5^n/8; otherwise fciy) can take an arbitrary value. We 
will show that: 

Lemma 12. fc has a fooling set of size at least q^^^/^. 

Note that as we are measuring space in terms of the number of elements from [q], Lemma [12] 
and Proposition 1111 imply Theorem [H 

In the rest of the section, we prove Lemma [T2l For notational convenience, define k = Rn, 
d = 6n, a = J and /3 = |. Thus we need to exhibit a fooling set of size at least q^'^. 

Our fooling set will be a subset F ^ C with an < ni < n such that F^n{\ ^i^'i ^[n]\[m] have 
distance at least ad/2. Further for any c£T and £ C \ F, A(c, c?) ^ ad/2. We claim that such 
an F is indeed a fooling set. To see this consider ^ (? £ F ^ where = (c},C2), = (cf,c|), 
c\ € and C2 € By definition, fc{(^) = fc{(?) = 1- Next we show that either 

fciyi) = or fc{y2) = 0, where yi = (c}, c^) and y2 = (cf , c^). For any c £ C \ F, this is true by 
definition of F. For any c £ F, this is true by the distance properties of Fj^^] and F[n]\[ni]- 

We will construct the fooling set F in a greedy fashion. We begin with the case when n < 
2(1 — a)d. We claim that in this case F = C and ni = \n/2\ works. Note that both ni and n — ni 
are both at most (1 — a)d. Since C has distance d, this implies that both and have 
distance at least d — {1 — a)d = ad. This completes the proof for the base case. 

For the general n ^ 2(1 — a)d case, we reduce it to the base case. In particular, we present 
a greedy iterative process, where at the end of the i^^ step, we have a subset Fi C C, with the 
property that for every c £ Fi and £ C \ Fi, A(c, cf) ^ ad/2. We of course start with Fq = C. 
It will turn out that we will run this process for r ^ times and F = Fy. For ease of exposition, 
we will also track variables rrii and di such that mg = n and do = d. Next, we mention the 
invariance that we will maintain with the iterative algorithm. First, it will always be the case that 
mj+i = mj — (1 — a)di and dj+i ^ di — adi. Think of rrii as the block length of the (projected 
down) Fi and di as the corresponding distance. 

Assume that we have our hands on Fi. If rrij < 2(1 — a)dj, then we are in the base case and the 
process terminates. (In this case r = i and ni = [mr/2\ .) If not, then Fi projected onto the first rrii 
positions (call this projected down code Gi) has distance at least di. Group codewords in Fi such 
that in each cluster the codewords in Gi projected down to the last (1 — a)di positions differ from 
each other in < adi positions. If the number of clusters is at least q^^, then let i^j+i be defined by 
picking one codeword from each of the clusters and the process terminates. (In this case r = i + 1 
and ni = rrir.) If not, then define Fj+i to be the largest cluster and define mj+i = mj — (1 — a)di. 
We then continue the process for i + 1. 

For the time being, assume that the following are true: (i) The process stops at iteration r such 
that r ^ (ii) Gi has distance di such that di ^ {I — ia)d; and (iii) Every codeword in Fi differs 
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from every codeword in -Fj_i \ Fi in at least ad/2 positions. Assuming these three properties, we 
argue that indeed has the required properties. 

Assume that the process terminates when the base case is reached. Let G = Gr- Then by the 
argument in the base case, we have that both G^^i] and G[mr]\[ni] (and hence, F^^^] and F^n]\[ni]) 
have distance at least adr ^ ad/2, where the inequality follows from properties (i) and (ii). Further, 
by property (iii), it is the case that for every c G F and c' G C \ F, A(c, (?) ^ ad/2. Finally, note 
that when we pick a single cluster, we have \Fi+i\ ^ \Fi\/q'^''. Thus, if we terminate with the base 
case, we have 

where the last inequality follows from the following argument for the inequality 1 — r/3 ^ /?. This 
inequality is satisfied if 

r<i-l. 

Now by property (i), r ^ ^ = | = |. Now as /? ^ 5/6 1/6, we have /3 1/(3/3) ^ 1//3 - 1, as 
desired. 

Now we consider the case when the process terminates before reaching the base case. In this 
case because of the termination condition, we have that Gr-i projected onto the last (1 — a)dr-i 
positions (and hence, F[n]\[ni]) has distance at least adj—i ^ ad/2, where the inequality follows 
from properties (i) and (ii). Also F^^i] has distance at least d^-i ~ (1 ~ a)dr~i = ad^^i ^ ad/2. 
Further, by property (iii), it is the case that for every c G F and c' £ G \ F, A(c, c') ^ ad/2. 
Finally, by the termination condition, we have \F\ ^ q^^, as desired. 

Thus, we are done with the proof modulo showing that properties (i)-(iii) hold, which is what 
we do next. We begin with property (i). Note that if we do not terminate in the middle, then we 
have di+i ^ (1 — a)di ^ (1 — aYd. Now note that 



r-l 

rrir = n — 



X-n w <- ^ (i-a)(i-(i-a)Od 

> (1 — a)di ^ n — d y [1 — a) = n . 

^-^ ^-^ a 

1=0 1=1 

Since mr ^ 0, we have 

1 - 1 - ar ^ = ^ 1 - exp -1/2 , 

(1 — a)d [1 — a)o 

where the last inequality follows from the fact that a = 5/4 ^ 1/4. Thus the above implies that 

(l-a)'^^exp(-l/2), 

which in turn implies 

. f ^ \ 1 

r In ^ — . 

VI - ay 2 

Using the fact that ln(l — x) = —{x + x'^/2 + x^/3 + . . . ) for |x| < 1, we get that the above implies 

r(a + aV2 + aV3 + ...) ^ ^, 
which in turn implies that ra ^ i, as desired. 
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We now move to property (ii). As we saw earlier, we have ^ (1 — aYd for i ^ r. Now as 
ar ^ 1/2 (and hence ai ^ 1/2), we have that (1 — a)* ^ 1 — ia, which proves property (ii). Note 
that this also implies that di ^ d/2. Finally, for property (iii), note that by construction, if we 
do not terminate in middle at step i, every codeword in -Fj+i differs from Fi \ -Fi+i in at least adi 
positions. Since, di ^ d/2, property (iii) follows. The proof is complete. 

Error detection of a projected down code. We will be dealing with RSs'[/c] with S = 
{ai,...,an}. In particular, we are interested in a one-pass, low space data stream algorithm 
to solve the following problem: The input is y € followed by a subset E <^ S with \E\ = e. We 
need to figure out if ys\E S RS5y£;[A;]. We have the following result: 

Lemma 13. Let e,n,k ^ 1 be integers such that k + e ^ n. Then the problem above can be solved 
by a one-pass, 0{e + alogn) space data stream algorithm with probability at least 1 — n~", for any 

5.4 Proof of Lemma 1131 

Consider the degree e polynomial Pe{X) = Yli^Ei-^ ~ "^O- Further, consider a new received word 
z = (zi, . . . , Zn) where Zi = yi ■ PE{oti). The algorithm to solve the problem above just checks to 
see if i*e RSs'[e + k]. 

We begin with the correctness of the algorithm above. If y[n]\E ^ RSg\^EW\, that is, y[n]\E is 
the evaluation of a polynomial f{X) of degree at most k — 1 over S\E, then z is the evaluation of 
f{X) ■ Pe{X) over S. In other words, z G RSs[k + e]. 

Now it turns out that the other direction is also true. That is, if i* G RSs[k + e] then y[n]\E £ 
RS[„]\£;[A;]. Note that z is the evaluation of a degree at most e -\- k — 1 polynomial g{X) over 
S, where g{X) = Pe{X) ■ h{X), where h{X) has degree at most k — 1. This is easy to see: by 
definition PE{X)\g{X) and the degree requirement on g{X) implies that h{X) has degree at most 
— 1. Finally, as PE{oii) ior i ^ E, this implies that h{ai) = yi for i ^ E. In other words, 

We conclude this proof by showing how to deal with the unknown E using a data stream 
algorithm. Since E is unknown, let us denote Pe{X) = X^ + Yli=oPi-^^j where {pi} are the 
unknown coefficients. Recall that in our error detection algorithm to check if z G ^^^^[e + k] we 
need to check if the following sum is 0: 

n /n—k—e \ 

3=1 \ j=0 / 

where /3 is a random element in a large enough extension field of Fg and is the parity check 

matrix of RSs[k]. Note that as Pe{X) = X^ + Yli-o Pi-^^ ^ sum above can be written as 
Qe + Y!,b=oPbQb, where (for ^ 6 ^ e) 

n /n—k—e \ 

j=l \ i=0 / 

Note that each of the Qh sums can be computed in one pass and low space without the knowledge 
of 
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Now the algorithm to check if z G RSs[k + e] is clear: maintain the e + 1 sums Qb- At the 
end of the pass, the previous algorithm knows the set E CI [n]. Given this, we can compute the 
coefficients {e6}^~Q. Then we declare y[n]\E S RSj^j^^; if and only if Qt + X^fe=o e^Qfe = 0- This 
extra computation will need storage for 0(e) elements in the extension field of F^. 

6 Tolerant testing 

In this section we assume that we are working with RS5[A;], where S = {ai, . . . , On} ^ Fg- (However, 
our results will also hold for closely related codes such as the folded RS code [17].) 

As was mentioned in the Introduction, there is a trivial reduction from one tolerant testing 
instance (say where we are interested in at most e vs. > e errors) to (") instances of error detection: 
for each of the (") potential error locations, project the received word outside of those indices and 
check to see if it's a codeword in the corresponding RS code via the algorithm in Theorem [H Using 
Theorem [6] (with a = 0(e)), we can implement this as an (1, 0(e log n), e/n, (e + l)/n)-|^ „_n(e)- 

tolerant tester. Unfortunately, this algorithm uses ^^^'^^ time. Next, we show how to obtain 
roughly the same space complexity but with a much better time complexity. 

Theorem 14. Let e,k,n ^ 1 be integers such that k ^ n and e ^ n — k. Then 

(a) If e{e + k) ^ 0{n), then there exists a {l,0{elog^ n),e/n,2e/n)i^-n{i)-tolerant tester for 
RSs[A;] under worst-case errors. 

(h) If ek ^ 0{n), then there exists a {l,0{elog^ n),e/n, 2e/ n)-^^ ^-n{i) -tolerant tester for RS s[k] 
under random errors. 

(c) If e ^ 0{ ^+{/ sn/k), then there exists a {l,0{e'^log^ n),e/n,5e/n)-^^^-n(i)-tolerant tester for 
the folded RS code with folding parameter s under worst-case errors. 

Further, all the algorithms can be implemented in 0{n'^) time. 

In the above, the soundness parameter follows by picking a to be large enough while using 
Theorem [6l We observe that a naive implementation achieves the 0{n'^) runtime. We also show 
that for part (a) and (b) by replacing n by re/ log n in the RHS of the upper bound on k and bumping 
up the space to 0{e^ log^ n), the algorithms for RSp^ffe] can be implemented in 0{ne) time. In fact, 
along with the faster running time, we get (1, 0(elog^ re), e/re, (e + l)/re)]^ „-n{i)-tolerant testers. 

We start with some notation. Given an t x n (list) disjunct matrix M let s and s' denote the 
minimum and maximum Hamming weight of any row in M. Further, let D{N) denote the runtime 
of error detector for RSqj^...^qj^ [A;]. 

We begin by analyzing the runtime of the tolerant tester from part (a) of Theorem [TH The 
runtime has two parts: one is the time taken to run the error detector for all the projected down 
codes, which are determined by the rows of the (e, e)-list disjunct matrix M. Note that this step 
takes time at most t-D{s'). The second part is the time taken to run the algorithm from Lemma [T3| 
which can be verified to be D{n). Thus, the overall running time is 

t- D{s') + D{n). (3) 

It can be verified that for list disjunct matrices from Section [HI t = O(elogn) and both s and s' 
are 0(n/e). Further, by Corollary [71 we upper bound D{N) by d{N'^). Thus, ^ implies that the 
runtime is upper bounded by 0{n'^). 
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Next, we look at part (a) when S = ¥q. In this case we will pick an explicit disjunct matrix. This 
classic matrix is defined by associating the columns with the codewords of RSf^, [k'] for appropriate 

choices of q' and k'. (Note that we then have q = n = {q')^ .) The columns of the matrix are 
the corresponding RS codewords, where each symbol from Fg/ in the codeword is replaced by the 
binary vector from {0, 1}*^ , which has a 1 only in the position corresponding to the symbol (when 
thought of as an element from \q'])- It is well known that if one picks k' = g'/e, then the matrix 
is e-disjunct and t = O(e^log^n) and s = s' = njy/t [13]. Note that one can index the rows of 
this matrix by the tuples (a, 6) G (Fg/)^. For the rest of the argument fix such a row (a, 6). The 
columns that participate in this row correspond to the messages (jtiq, . . . ,mfc/_i) G F^, such that 

^*Lq^ mjO* = h. Call these set of vectors Si,. Before we proceed we recall that since q = {q')'^\ 
there is an isomorphism between Fg and F^, . Now note that Sh is a linear subspace and thus, 
for any 7 G Si, W^eSb c^^'y^^ ~ ^) (where we think of the operations as happening over Fg) is just 
the product of non-zero vectors in Si, which is some fixed constant (say) /? G F*. Now note that 
the error detection corresponding to row (a, 6) is for the projected down code RSs'JA;]. Thus, we 
now satisfy the second condition in Corollary [71 which implies that we can assume that the error 
detection can be done in linear time. Finally, for RSp^ [k] it is well known that the second condition 
in Corollary [7] is also satisfied. Thus the overall runtime is bounded by 0{t ■ n/y/i + n), which is 
0{ne), as desired. 

(We remark that if we can get the best of both the random list disjunct matrix construction, 
i.e. t = O(elogn) and both s,s' in 0(n/e), and the explicit RS code based disjunct matrix, i.e. 
the second condition of Corollary [7] is true, then we can have a tolerant tester with the optimal 
runtime of 0(n).) 

The proof for the runtime for part (b) in Theorem [T3] is identical and is omitted. The proof 
for the naive implementation runtime for part (c) is similar to part (a)- everything gets multiplied 
by O(loge) factor, which is at most an extra log factor. We do not know of an explicit (list) 
disjunct matrix that satisfies the extra requirements for part (c) and thus, we do not have any 
implementation with runtime better than 0(?i^). 

For the rest of the section, we will focus on the other parameters of the algorithms. 

All of the results above follow from a generic reduction that uses group testing. In particular, 
let y be the received word that we wish to test, and cbe the nearest codeword to y. Let x G {0, 1}" 
be the characteristic vector associated with error locations in y with respect to c. The high level 
idea is essentially to figure out x using group testing. 

Let M be a i X n binary matrix that is (e,e)-list disjunct. By Section [8] we can get our hands 
on M with t = O(elogn) with O(elog^n) space. Now consider the following natural algorithm. 

For all i G [t], check if ^a/. G RSMi [k] , where Mj is the subset corresponding to the zth 
row of M. If so, set rj = 0, else set rj = 1. Run A from Proposition 1171 with r as input, 
to get x. (Step 1) If wt(x) ^ 2e, declare that ^ 2e errors have occurred. (Step 2) If 
not, declare ^ e errors iff ys\T G R'S5'\j'[A;], where T is the subset corresponding to x. 

The way the algorithm is stated above, it seems to require two passes. However, using Lemma [T3| 
we can run Step 2 in parallel with the rest of the algorithm, resulting in a one-pass implementation. 

Let z be the result of applying M on x. Now if it is the case that Zi = 1 iff = 1, then 
the correctness of the algorithm above follows from the fact that M is (e, e)-list disjunct and 
Proposition [TTl (If wt(x) ^ e, then we have S^ C Sg (where 5*^ is the subset of [n] whose incidence 
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vector is x) and wt{x) < 2e, in which case the algorithm will declare at most e errors. Otherwise, 
the algorithm will "catch" the at least e errors in either Step 1 or failing which, in Step 2.) 

However, what complicates the analysis is the fact that even though Zi = implies rj = 0, the 
other direction is not true. In particular, we could have ypii S RSjv/J^]) even though {y — c)Mi / 0. 
The three parts of Theorem [T3] follow from different ways of resolving this problem. 

Note that if wt(x) ^ 2e, then we will always catch it in Step 2 in the worst-case. So from now 
on, we will assume that < wt(x) ^ e. Let the minimum support of any row in M be s. 

We begin with part (a). Let s > k + e and define A = y — c. Note that we are in the case where 
< wt(A) ^ e. Since s ^ k and ca/,. G RSMi[k], yhu G RSMiW\ if and only if Aa/. G RSMi[k]. 
Note also that for any i, WT(AMi) ^wt(A) ^ e. Now, the distance of RSMi[k] is s — A; — 1 > e, so 
for every i with non-zero A^/i, A^/- RSmJ^]; which in turn means that Zi = l will always imply 
that Tj = 1 when M has the stated support. By Section [HI we have s ^ n/(2e), which concludes 
the proof of part (a). 

The following lemma follows from the random errors result in |21j and is needed for part (b): 

Lemma 15 ([21]). Let k ^ n < q be integers such that q > (f Then the following property 
holds for RS codes of dimension k and block length n over ¥q : For ^ 1 — q~^^^'^ fraction of error 
patterns e with WT(e) ^ n — 4:k and any codeword c, the only codeword that agrees in at least Ak 
positions with c + e is c. 

Now if s ^ 4:k, then with high probability, every non-zero Aa/. RSj\/. [k] (where A is as defined 
in the proof of part (a)). The fact that s ^ n/(2e) completes the proof of part (b). 

The proof of part (c) is more involved and needs a strong connection to the list decodability of 
the code being tested, which we discuss next. 

Connection to List Decoding. Unlike the proofs of part (a) and (b) where the plain vanilla 
(e, e)-list disjunct matrix works, for part (c), we need and use a stronger notion of list disjunct 
matrices. We show that if the list disjunct matrix is picked at random, the bad tests (i.e. = 
even though Zi = 1) do not happen often and thus, one can decode the result vector even with these 
errors. We show that these kind of matrices suffice as long as the code being tested has good enough 
list decodability. The tolerant testing algorithm for a Reed-Solomon code, for instance, recursively 
reduces the amount of errors that need to be detected, and after application of Lemma [T3l can be 
made to accomplish this in a single pass. We also show that the relevant list disjunct matrices can 
be found, with high probability, using low space and a low number of random bits. 

We need to show what the forbidden subsets will be in our setting. Let C be the code we are 
trying to test. Let y be the received word and let c G C be such that A{y,c} ^ e ^ d/2, where 
d is the distance of C. Let C be {n — a, L + l)-list decodable, that is, for any Hamming ball of 
radius at most n — a, there are at most L + 1 codewords from C in it. Note that if n — a ^ A{y, c), 
then there are at most L codewords (other than c) that agree with y in at least a positions. Also 
note that each such codeword agrees with y in at most n — d/2 positions. Let T C [n] be the set 
of positions where y and c agree. Then define J^a,n-d/2(T) to be the (at least a) positions where 
codewords other than c agree with y. As C is (n — a, L + l)-list decodable, \ J^a,n-d/2(T)\ ^ L. 

We now show how one can use list disjunct matrices from Definition [1] to construct data stream 
algorithms for tolerant testing of an RS code C. Assume that there exists a ^ 7 ^ 1, such that 
for every (large enough) / ^ 1, there exists a strongly explicit (/, /, 7, -/^a,n-d/2)"list disjunct matrix 
Mf. We next show how these matrices can be used to solve the tolerant testing problem for C 
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where we want to distinguish between the case that at most e errors have occurred and at least 
2ej5^ + 1 errors have occurred. 

First consider the case when e ^ k. In this case we can use the algorithm from part (a) of 
Theorem I14[ Thus, if e ^ A;, then we can handle e vs. 2e errors, where e ^ n/(2k), in space 
O(elog^n). Now consider the case when e > k. Let us use the matrix Mg on y as we did before 
That is, for every row of Mg (in particular, the corresponding subsets S C [n]), check if ys € RSs[k]. 
If so, assign rj = (otherwise assign = 1). Given the result vector r, run A from Proposition [T71 
on it to obtain a subset G C [n] such that |G| ^ 2e and [n] \ G contains at most 76 errors. 
Thus, we have reduced the problem from at most e errors out of n positions to the problem of 
at most 76 errors in at least n — 2e positions. Then, the rest is natural: recurse on this idea. 
We stop when we are left with at most k errors. Note that we will need 0(log e/ log(l/7)) many 
recursions. Because of these recursions, we can handle the case when there are at most e errors 
vs at least 2e(l + 7 + 27^ + ...) + 2k errors. This implies (as e > A;) we can definitely handle e 
vs. 26^5^ + 1 errors. The way the idea is stated above, it seems like an 0(log e/ log(l/7))-pass 
algorithm. However, using Lemma [T3l the algorithm outlined above can be implemented in one 
pass. 

We are all done except the construction of the list disjunct matrices as defined in Definition [1] 

Theorem 16. Let e, n,d,a,L ^ 1 be integers such that e ^ O (^jj^^x) ■ There exists a large enough 

constant c > 1 such that ifc-elogn ^ n, then the following holds: There exists a (e, e, ^,J'a,n-d/2)- 
list disjunct matrix with t = celog n rows (where for every E CI [n] with\E\ ^ e, \J-'a,n-d.2iE)\ ^ L). 
Further, every row has at least ^ ones in it. In addition, one can construct such matrices with 
probability at least 1 — n~^^^^ using R = 0{t{e + logL) • loge • logn) random bits. Further, given 
these R bits, any entry of the matrix can be computed in poly(logn) space. 

A folded RS code (with "folding parameter" s), is (^it- ~ 

n,n^^' j -list decodable |17] . 

Thus, Theorem [TU] proves part (c) of Theorem UM (Note that Theorem [TU] also has the constraint 
that e ^ 0{d/ log L). However since logL is O(logn) above and as long as d = 0(n), this bound is 
much weaker.) 

We prove the existence of the required object by the probabilistic method. In fact this proves 
the second part but with R = 0{ntloge). To reduce the randomness, we observe that the proof 
only requires bits that come from an 0{t{e + logL) loge)-wise independent source. 

Let M be a t X n matrix, where each entry is chosen to be one with probability 1/e and 
t = c ■ e log n for some large enough constant cH 

We first argue about the minimum support size of any row in M. It is easy to check that the 
expected Hamming weight of any row in M is exactly n/e. Thus, by the Chernoff bound, the 
probability that any row has Hamming weight at most n/2e is upper bounded by 

exp (-^) ^ exp(-clogn/12) n'^^^ (4) 

where the last inequality follows for large enough c. Now by the union bound (and the fact that 
t ^ n), all the rows have Hamming weight at least n/{2e) with probability at least 1 — n~^^^ . 

Next we move on to proving property (a) from Definition [1] for M with 61 = To this end, 
fix subsets U,T CI [n] with \U\ = e, |r| = e and J7 n T = 0. Call a row j G [t] good if there exists a 

''in this proof, we have not attempted to optimize the constants. By a conservative estimate, picking c = 10^ 
would sufHce. 
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i E U such that Mj^i = 1 and Mj^£ = for every i €z T. Now the probabihty that a row is good is 
exactly 



/ 1 



1-1-7 1-7 (5) 



e 

where the inequality follows if e ^ 2 and the fact that (1 — 1/x)^' ^ exp(— 1) ^ 1/2. Thus, the 
expected number of good rows is at least t/8. By the Chernoff bound, the probability that the 
number of good rows is at most t/16 is upper bounded by 

celogn^ ^ ^__^90g 



where the inequality follows for large enough c. Thus, with high probability, the number of good 
rows is at least t/16. Then by the pigeonhole principle, at least one column i £ U is contained in 

at least 62 good rows. Taking the union bound over the (") {^~^) choices of T and U implies 

that with probability at least 1 — n"^^^*^, property (a) is satisfied for every valid choice of T and U. 

Next, we move on to the more involved part of the proof, which is to prove property (b) in 
Definition [TJ To this end, given any T C [n] with \T\ ^ e and a column i G [n] we will upper 
bound the probability that at least bi tests that contain i are themselves contained in some subset 
in J-'a,n-d/2{T)- It turns out that this probability will be n~'-^^^\ which is not enough to apply the 
union bound over all the (g) choices of T. We then observe that these probabilities are almost 
independent for any il.{e) such columns, which is sufficient for the union bound over all choice of T 
to go through. 

Fix a subset T C [n] with |T| ^ e and a subset S £ J^a,n-d/2{T). (Note that |5| ^ n — d/2.) We 
say that a row j £ [t] avoids S {J^a,n-d/2{T) resp.) if the jth row (which we will denote by M{j)) 
is not a subset of S (any subset in Fa,n-d/2{T) resp.). In other words, if j avoids S then Mj^i = 1 
for some i ^ S. Thus, we have 

Pr [j doesn't avoid S] = 1 - - ^ h _ _ ^ exp , (7) 



ey \ e J \ 2e 

where first inequality follows from the fact that |5| ^ n — d/2. 

Fix a column i £ [t]. Note that if i S" and M{j) contains i, then j does avoid 5. Now, 
i £ S and given that the probability in ([7]) only depends on the indices i ^ S, we get that 
Pr [j contains i and doesn't avoid S] ^ exp(— ci/(2e))/e. Thus, whether i £ S oi not, we have by 
the union bound 

Pr [j contains i and doesn't avoid J^a,n-d/2{T)] ^ — • exp ( — 7^ ) ^ -^7—; (8) 

where the last inequality follows if e ^ ToTnL' 

Now call a row j i-had if it contains i but does not avoid ^a,n-d/2{T). (If it contains i and 
avoids J^a,n-d/2iT), the call it i-good.) Note that we need to show that for at least (1 — 7)^ columns 
i £ T, there are at most bi i-bad rows. Thus, by the expected number of i-bad rows is at most 
t/(80e), or the expected number of z-good rows is at least 79t/(80e). By the Chernoff bound, we 
have 

78t 39t " / t \ 

Pr Number of i-good rows < = ^ exp ( | ^ n~^^^, (9) 

^ 80e 40e ^ V 3 • 79 • 80e / ' ^ ^ 
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where the last inequahty follows from large enough c. Since the expected Hamming weight of any 
column is t/e, the Chernoff bound implies that 



Pr 



81t 

Column i has Hamming weight ^ 



exp 



t 



3 • 802e 



< n 



-190 



where again the last inequality follows for large enough c. Thus, Q and (jlOp imply that 

3t " 



Pr 



Number of z-bad rows > 



80e 



(10) 



(11) 



where the last inequality is true for n ^ 2. Unfortunately, the bound above is too weak to apply 
the union bound over all the (") choices of T. However, we get around this obstacle by proving 
that for any Q{e) values of i G [t], the probabilities above are essentially independent. 

Call a column i had if the number of bad i-rows is at least hi (for some ^ <hi < to 
be fixed later). For notational convenience, define ^ = Next we are going to show that for any 
subset y = {ii, . . . , i^} C T, 

Pr [Every j G F is bad] ^ (12) 

Note that the above implies that the probability that more than i columns in T are bad is upper 
bounded by 



60 



2e 



where the last inequality follows for n ^ 180. Thus, the probability that for some T C [n] with 
|T| ^ e, there are more than I bad columns in T, by the union bound, is upper bounded by n~'^ . 
Thus, property (b) is true with probability at least 1 — n~^. 

To complete the proof, we will prove (I12p . Note that we can rewrite the probability in (|12p as 



Pr 



ii is bad | ^ j is bad 

jev\ic 



■ Y[ [j is bad] < Pr 



ii is bad | ^ j is bad 

jeV\ie 



■ n 



-2e 



where the inequality follows from (|lip and the fact that hi > 3i/(80e). Thus, we will be done if we 
can show 



Pr i£ is bad | ^ j is bad ^ n ^. 
To this end, let B C [t] be the rows that contain at least one column from V \ ig. Note that 



(13) 



B\\B\]=t{ 1-1 



1 



(14) 



By the Chernoff bound, we have 
Pr 



(15) 
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As for any real x > and integer y > with xy < 1, 1 — xy ^ (1 — x)^ ^ 1 — xy + {xy)'^ /2, we have 
59/60 ^ (l - ^ 59/60 + 1/7200 < 119/120. This along with ([l5]) implies that 



Pr 







li^l > 

_ 50_ 


^ exp ^ 



t 



< n 



-190e 



75 • 120^ 

where the last inequality follows for large enough c. Similarly, one can show that 



Pr 



Number of rows in B that contain ii > 



t 

50e 



^ n 



-190 



We do a conservative estimate and assume that all tests in B that contain are Z£-bad. Because of 
the bound above, w.l.o.g. with all but an n~^'^^ probability, we can assume that = t/50. Using 
the same calculatioiiQ as we did to obtain (|lip . we can show that 



Pr 



Number of i^-bad rows in i? > 



3-49t 
4000e 



^ 2-n 



-49 190/50 



< n 



-185 



Adding in the number of rows in B that contain i^, we obtain that 



Pr 



t 147t 

Number of i/-bad rows > 1 

50e 4000e 



j is bad 



180 



Picking hi = ^ + < 62 completes the proof of (fT3]l . Thus, we have completed the proof of 



L _|_ 147 
„ jO 4000 

the existence of the desired (e, e, gg , -7\i,n-d/2)-list disjunct matrix. 

In fact, the proof shows that the required matrices can be computed with high probability. 
However, at least r2(n) random bits are required, which is too high for any data stream application. 
Next we point out that the proof only requires limited independence and hence, we can get away 
with much fewer random bits. In the remainder of the proof, we will think of the bits of M to come 
from some fc-wise independent source that contain bit strings of length tn. 

We now go through the proof above and estimate the amount of independence needed. The 
first place that needs independence is and we claim that 0(logn)-wise independence suffices. 
This follows from the tail bounds for A;- wise independent sources from [6]. In particular, Bellare 
and Rompel show that for a A;-wise independent source, the sum of binary random variables with 

fc/2 



mean /i can have a deviation of strictly more than A with probability at most 8 • ( 



Note 



that in our case /i = n/e, A = n/(2e) and since n/e ^ clog?^, picking a 0(logn)-wise independent 
source works. 

The next places in the proof that use independence are ([5]) and ([6]). It is easy to check that 
the calculations go through if we have 2et-wise independence. Next, independence is used in ([7]). 
Note that in this case we need to upper bound the probability by (1 — l/e)^(^^°§^). Thus, picking 
0(elogL)-wise independence works for this case. Next ([9]) needs t/e-wise independence. This 
follows from the tail bound for A;- wise independence from [6]. Note that we actually need the product 
of the independence used in d?]) and ([9|), that is, we need a total of 0(tlogL)-wise independence. 
For (1101) t-wise independence suffices. Finally for ()14p and ()15l) we need £t-wise independence. In 
fact, again using the bound from [^, we can get away with 0(t)-wise independence. 



''We need to replace t by 49t/50. Further in ((TJ, we need to replace n — [S'l by n — |5| — e/60 as in the worst case 
{ii, . . . C [n] \ 5*. However, this does not change the upper bound in (|8} as long as we pick e ^ d/(151nL). 
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Thus, overall we need 0(t(e + log L))-wise independence. Generally, fc-wise independent sources 
are for unbiased bits where as we need random bits that take a value of one with probabil- 
ity 1/e. However, since we can get such random bits from O(loge) unbiased bits, we will need 
0(t(e + log L) log e)-wise independent sources containing 0(nt log e) bit strings. Using well-known 
construction of /c-wise independent sources, we can get away with R = 0{t{e + log L) log e • log(nt)) 
random bits. This completes the proof as t ^ n. 

6.1 List Disjunct Matrices. 

We begin with the definition of a stronger kind of list disjunct matrices: 

Definition 1. Let n, si, e,i,L ^ 1 be integers with si ^ S2 and Zei ^ 7 ^ 1 be a real. For any 
subset T C [n] such that \T\ ^ e, let Tsi,s2{T) be a collection 0/ forbidden subsets of [n] of size in 
the range [si,S2] such that \Tsi,s2{T)\ ^ L. Atxn binary matrix M is called a (e, ^, 7, Js^^s2)-list 
disjunct matrix if there exist integers ^ 61 < 62 such that the following hold for any T C [n] with 
\T\ ^ e: 

1. For any subset U C [n] such that \U\ ^ £ and U OT = 0, there exists an i ^ U with the 
following property: The number of rows where the ith column of M has a one and all the 
columns in T have a zero is at least 62- 

2. The following holds for at least (1 — 7)6 many i € T: Let Ri denote all the rows of M (thought 
of as subsets of [n]) that contain i. Then \{U G Ri\U C V, for some V G J^s^^s2iT)}\ ^ ^1. 

The definition might appear complicated but it is setup to easily imply Proposition [T71 Further, 
a (e, i, 0, 0)-list disjunct matrix (with 61 = and 62 = 1) is the same as the (e, ^)-list disjunct matrix 
considered in jl8j . Further, an (e, l)-list disjunct matrix is the well-known e-disjunct matrix |13| . 

Let us also define the following error version of group testing that will be relevant to our scenario. 

Definition 2. Let n,si,S2,e,L ^ I be integers. For every T C [n] such that \T\ ^ e, let J^s^^^s^iT) 
be the collection of forbidden subsets as in DefinitionUl Then (e, J-"si,s2)'§i'o^P testing works in the 
following manner: Given a set of defectives T C [n] such that \T\ ^ e, any test U C [n] behaves as 
follows: If U n T = 0, then the test will return an answer of 0. // C/ Pi T 7^ and [/ C V for some 
V G J-"sj^s2(^)> then the test will return an answer o/O. Otherwise the test returns an answer ofl. 

The algorithm A in the below proposition is a natural generalization of the standard decoding 
algorithm for e-disjunct matrices |13j . 

Proposition 17. Let n,e,i, si, S2, L,j,J^si,s2 be as in Definition{l\ Let M be a (e, ^, 7, Js^^sj)" 
list disjunct matrix with t rows. Finally, consider an outcome vector f of applying M to a set of 
defectives E with \E\ ^ e in the {e,J^si,s2)'9^oup testing scenario. Then there exists an algorithm 
A, which given r can compute a set G such that \G\ ^ ^ -|- e — 1 and \E\G\ ^ 76. Further, A uses 
0{t + \ogn + S{t,n)) space, where S{t,n) is the space required to compute any entry of Ad . 

6.2 Proof of Proposition [T71 

The algorithm A is very simple: Go through every column i G [n] and declare z G if and only if 
the number of rows j G [t] where Mj^i = 1 but rj = is at least 62- It is easy to check that A has 
the claimed space requirement. The correctness of A follows from Definitions [1] and [2j To see this 
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note that if |G| ^ e + i, i.e. |G \ -Ej ^ i, then by part (a) of Definition [H there exists an i £ G\E 
with the following property: There are at least 62 rows j G [t] such that Mj^i = 1 but Mj^i' = for 
every i' G E. By Definition [21 for every such j, rj = 0. Thus, by definition of A, i cannot be in 
G. Now consider an i £ E for which property (b) of Definition [T] holds. Now by Definition [21 there 
are at most bi rows j E [t] such that Mj^i = 1 and rj = 0. Since bi < b2, A includes i in G. This 
implies that \E\G\ ^76. 

The space requirement of 0(e^ log^ n) of part (c) is unsatisfactory. Reducing the amount of 
randomness needed to something like O(elogn) will realize the full potential our algorithm. We 
leave this as an open problem. 

7 Limitations of our techniques 

One shortcoming of Theorem [T4l is that to distinguish between (say) at most e and at least 2e errors, 
we needed e ■ s ^ 0{n), where s is the minimum support size of any test. Another shortcoming is 
that we need O(elogn) space. In this section, we prove that our techniques cannot overcome these 
limits. 

We begin with some quick notation. For any k ^ 1, a k^~^ query to a string x G {0, 1}" 
corresponds to a subset S CI [n]. The answer to the query is xs if wt{xs) < k, otherwise the 
answer is k'^'^ (signifying that wt(x5') ^ k). (This is a natural generalization of k~^ decision trees 
considered by Aspnes et al. [5]-) ^ algorithm to solve the (^, t, n)-threshold function makes a 
sequence of A; queries to the input x G {0, 1}", and can tell whether wt(x) ^ ^ or wt(x) ^ t. 
If we think of x as being the indicator vector for error locations, then our reduction from tolerant 
testing to error detection can be thought of as a l"*"*" algorithm for the (e, 0(e))-threshold function. 

First we show that the minimum support size that we obtain in our reduction, even with the 
stronger fc"*""*" primitive, is nearly optimal. 

Theorem 18. Let ^ i < t ^ n and k ^ 1 be integers. Let e < 1/2 be a constant real. Then any 
non-adaptive, randomized k^"^ algorithm for the (i,t,n) -threshold problem with error probability 
at most e, where all the queries have support size at least s, needs to make at least e*^/"/n*^^^^ 
queries. Ln particular, any algorithm that makes a sublinear number of queries needs to satisfy 
s ■ i ^ 0{knlogn). 

7. 1 Proof of Theorem [H] 

Define the following distribution D on inputs in {0, 1}": uniformly distribute a probability mass of 
1 /2 over the (") vectors of Hamming weight exactly £ (call this set J\f) and the rest of the probability 
mass uniformly over the (") vectors of Hamming weight t (call this set 3^) . We will show that any 
deterministic non-adaptive k~^^ algorithm with an error probability at most e (according to T>) 

si 

must make at least ^Jlk) queries. Yao's lemma will then complete the proof. 

Fix an arbitrary k'^^ algorithm A that has error probability at most e. Thus, A outputs the 
correct value of in at least (^yl-e) ^ 1 ~ 2e fraction of elements in M (call this set of elements A/q). 
Similarly, the algorithm outputs the correct value of 1 in at least 1 — 2e fraction of the elements 
in y (call this set yi). Any k~^~^ query is said to cover a pair of inputs {xo,xi) G A/q x 3^i, if it 
outputs different answers for the inputs xq and xi. Note that all the pairs in A/q x have to be 
covered by some query in A. 
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To complete the proof, we will show that at least ^^^1'^ jrpi^) queries are needed to cover A/q x3^i. 
To this end given an arbitrary query Q of support i s,we will bound the number of pairs it can 
cover (call this number of pairs Pq). Note that Q will not cover a pair {xq,xi) if both xq and xi 
have at least k ones in the support of Q. Thus, to upper bound Pq, we will count the number of 
pairs {xq,xi) such that either xq or xi have support < k in Q. This latter count is clearly upper 
bounded by 



max 




where for notational convenience we define (^) = for b > a. We claim that the above is upper 
bounded by (see Appendix |X] for a proof): 



max 



n\ I n — s\ /n — s\/n 

t' j'V f )\t 



where t' = maxo^j<jfc-i{i — j|i — i ^ ?^ — s} and = maxo-gj^A:-i{^ — — j ^ n — s}. The way 
we are going to proceed with the rest of the proof, the maximum in the above will occur for the 
second argument, i.e. from now on, we have that for any query Q, 



As 



PQ^kn^^'-'H'';/)n='Pn.a.. (16) 



\Moxy,\;,ii-2erQr], (17) 



by the pigeonhole principle, the number of queries that A needs to make is at least 

Pma. " ^n3('=-l)(V)(") 

" /fcn4('=-i)("-0 ^ ' 

^8(1 - 2e)2e^^> , , 

> , ^ (20) 

A;n4(fc-i)^27(n+l) ^ ^ 

V8(l-2.)V^/» 

^^4(fe-l)gfc-1^27(7i + l) ^ ^ 

In the above, (jlSp follows from (jl7p and (jl6p . (|19p follows from the following argument. Note 
that if £ < n/2 then Q) ^ (;5). If > n/2, then (^) ^ (^) /n^"^'. Finally if f < n/2 and £ ^ n/2, 
then C^) ^ if \n/2-£\ < \n/2 - i'\ otherwise (^) ^ (p)/nl"/2-^l-l'^/2-^'l. Thus, in all cases, 
(") ^ (P)/"^"^' ^ where the last inequality follows from the fact that i - i' ^ k - 1. 

(j20p follows from Lemma [T9l Finally (j2ip follows from the fact that i' ^ i — k + 1 and s ^ n. 

We are done except for the following lemma: 

Lemma 19. Let a ^ n and b ^ n — a be integers. Then 



ib) > ab/n / ^ 

" Y 27(n + 1) ' 
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Proof. Stirling's approximation can be used to obtain the following bound for n ^ 1, 



V2Trn ( — ] ^ n! ^ VSvrn ^ 

. e 



In particular, this implies that for any x ^ y, 



I I 2y yy ^ (y\ ^ 1 / 3y 



SV Trx{y — x) x^{y — x)y~^ \xj 2\i'Kx{y — x) x^{y — x)v~^ 
Using the bound above, we get 

("-'^) " ^ ^ ' ' ' (n - bY~\n - aY-'^ ' 
where 



/ 8 n(n - a - 6) 

/a, b,n) = J—- — ^ 

y 27 (n — o)(n — a) 



\ n{n—a—b) ' ' 



where the last inequality used the facts that ab ^ and n — a — b 1. 
Now consider the following sequence of relationships 

n \n — a — by ( n \ / n \ / n[n — a — b) ^ 



{n — b)"- ^(n — a)"- " \n — a J \n — b J \{n — a){n — b) 

1 1 1 



n— a— 6 

I ab ' 

n{n—a—b) 



where the inequality follows from the following two facts (for x, y > 0): 



l + ^]%e^ and (1 - ^ e'^^'. 



(22) 



~^ \\i^^rr-ZK^ (23) 



1 1 1 

Q—ab/n Q—ab/n ^ab/n 

^ab/n^ (24) 



([22]), (I23|) and (IMl) complete the proof. ■ 

Note that our reduction maps one tolerant testing problem instance (where say we want to 
distinguish between at most e error vs. at least 2e errors) to O(elogn) many instances of error 
detection. Next we show that this is essentially unavoidable even if we use queries for constant 
k. The following result follows from the results in j5]: 

Theorem 20. Let ^ i < t ^ n and k ^ 1 be integers. Then any adaptive, deterministic k~^~^ 
algorithm for the (i,t,n) -threshold problem makes 0,{£/k) queries. 



25 



7.2 Proof of Theorem [20] 



The proof will be by an adversarial argument to show that if r < (./k k'^'^ queries are made then 
there exist two inputs x and y on which the answers to the queries will be the same, yet wt(x) ^ I 
and Wt(?7) ^ t. Note that the existence of such a pair of inputs will complete our proof. 

We will think of the adversary as maintaining a set of positions U {i) after the ith step. The 
invariance that the adversary will maintain is that U{i — 1) C U{i) and more importantly, that any 
input X such that = 1 will be consistent with answers to the queries. Finally, it is also the 

case that |C/(«)| ^ ki. Note that if we can come up with a way to construct these subsets {U{i)Yi=i, 
then the proof will be done (consider the inputs 1 and a such that du^^^ = 1 and a[n]\(7(r) = 0)- 

To complete the proof, we will show how the adversary can construct the set U{i). Given the 
ith k^'^ query S C [n], the adversary constructs U{i) as follows: Let S' = S\U{i—l). If |5'| ^ k, 
then let U{i) = U{i — 1) \J S. Otherwise pick an arbitrary subset T <Z S' such that \T'\ = k and 
define U{i) = U{i — 1)L) T. In both cases, the adversary answers the query as follows: If \S\ ^ k, 
return an answer of A;"*""*", otherwise report that the substring indexed by S is the all ones vector. 
It is easy to check that U{i) satisfies all the required properties. 



8 Randomness Efficient Construction of List Disjunct Matrices 

In this section, we show that (e, e)-list disjunct matrices can be constructed with t = O(elogn) 
rows (each with support at least n/(2e)) with O(elog^n) random bits. 

For this, we will need Nisan's PRG for space bounded computation [20]. Nisan's result states 
that there exists a function G : {0, 1}"^ — )• {0, l}'^ such that any Finite State Machine that uses 
0{S) space and R random bits, cannot distinguish between truly unbiased random R bits and the 
bits G{x) (for x chosen randomly from {0,1}"^) for T = 0{S log R) with probability more than 
2~'^('S'). Further, any bit of G{x) can be computed, given the T random bits x (and 0{S) extra 
storage) . 

We first use the probabilistic method to show that the required object exists with high prob- 
ability. Then we show that the proof can be implemented in low space and use Nisan's PRG to 
complete the proof. 

Let t = c ■ elogn, where c is some large enough constant so that all calculations go through. 
Also let a ^ 1 be a large enough constant to be determined later. We will also assume that t ^ n 
so that n/e ^ clogn. Let M be a random t x n matrix, where each entry is one independently 
with probability 1/e. Now to prove that M has the required property, we show that it satisfies the 
following two properties with high probability: 

(a) Every row of M has Hamming weight at least 

(b) For any two disjoint subsets 5, T C [n] such that \S\ = \T\ = e, there is at least one row such 
that at least one column in T has a one in it while all the columns in S have a zero in it. 

We begin with (a). Note that in expectation any row has n/e ones in it. Thus, by Chernoff 
bound, the probability that any row has Hamming weight at most n/(2e) is upper bounded by 

where the last inequality follows for c ^ 24a and the first inequality follows from the assumption 
that t ^ n. 



26 



Next, we move to (b). Fix a row j G [t]. Now the probability that Vi^xMj^i = 1 and Vjg^Mj^i 
is exactly 



where the last inequality follows for e ^ 2. Thus, the probability that there does not exist a row 
as desired in part (b) is upper bounded by 

rj\ ce log n 



where the last inequality follows for c ^ 20(2 + a). Thus, by the union bound, part (a) does not 
hold with probability at most (for e ^ 2). 

Thus, M does not have the desired property with probability at most (for n ^ 2). 

Next, we estimate the space required to implement the proof above, i.e. given R = nt bits of 
the entries in M, we need to figure out how much space is needed to verify whether M has the 
required property or not. For part (a), we need 0(logt + logn) space to keep track of the row and 
0(log(n/e)) bits to check if the row has Hamming weight at least n/(2e). So we can implement 
part (a) with O(logn) space. For part (b), we need O(elogn) space to keep track of the subsets S 
and T. For given S and T, we need O(logt) space to keep track of the rows and O(logn) space to 
verify if it is the row that "takes care" of S and T. Thus, for part (b) we need O(elogn) space. 

Thus, overall we have S = O(elogn). We are almost done, except for one small catch: Nisan's 
PRG deals with unbiased bits but we need random bits that are biased. However, we can obtain a 
random bit that is one with probability 1/e from O(loge) unbiased bits (by declaring the final bit 
to be one if and only if all the unbiased bits are 1). Thus, we can convert the proof above to use 
R' = 0(log e • R) unbiased random bits. Further, this conversion needs an extra 0(log log e + log R) 
space, which implies that the total space used is S' = O(elogn). 

Thus, by Nisan's PRG we would be done with 0{S' log R') = O(elog^n) random bits. Us- 
ing Nisan's PRG will increase the error probability to + 2~'~'^^\ which can be made to be 
polynomially small by picking a appropriately. 
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A Upper bounding a sum 

We begin with the sum 




Let s,i,t and n be such that s ^ i ^ n and t ^ n. The sum above is then upper bounded by 




Now define j* to be the minimum ^ j ^ k — 1 such that t — j* ^ n — s (if no such j* exists then 
the sum is 0). Now upper bounding (") ^ n''^^ for j ^ k — 1, we can again upper bound the sum 
above by 




From the bound that (°) ^ a(^^ J , we get that (""|) ^ (n - s)^-^* {^jfl ^ n^^-^ {"^jfl . This along 
with the bound above implies that 

|:G)C)(::;)-"'"-'("r)^ 

where t' = t — j* , as desired. Similarly one can show that 

gG)G)(::;)--"'"-'("r)' 

where £' = maxo^j<gfc_i{£ — j\i — j ^ n — s}. 



30 



